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Abstract 

In this paper, we address the problem of identifying linear structural equation models. We first extend 
the edge set half-trek criterion to cover a broader class of models. We then show that any semi-Markovian 
linear model can be recursively decomposed into simpler sub-models, resulting in improved identification 
power. Finally, we show that, unlike the existing methods developed for linear models, the resulting 
method subsumes the identification algorithm of non-parametric models. 


1 Introduction 


Many researchers, particularly in economics, psychology, and the social sciences, use linear structural equa¬ 
tion models (SEMs) to describe the causal and statistical relationships between a set of variables, predict 
the effects of interventions and policies, and to estimate parameters of interest. When modeling using lin¬ 
ear SEMs, researchers typically specify the causal structure (i.e. exclusion restrictions and independence 
restrictions between error terms) from domain knowledge, leaving the structural coefficients (representing 
the strength of the causal relationships) as free parameters to be estimated from data. If these coefficients 
are known, then total effects, direct effects, and counterfactuals can be computed from them directly ( |Pearl| 


2009; Chen and Pearl 2014J. However, in some cases, the causal assumptions embedded in the model are 


not enough to uniquely determine one or more coefficients from the probability distribution, and therefore, 
cannot be estimated using data. In such cases, we say that the coefficient is not identified or not identifiable 
Many SEM researchers determine the identifiability of the model by submitting the specification and 
data to software, which attempts to estimate the coefficients by minimizing a fitting function (Kenny and 


Milan 


2012). If the model is not identified, then the program will be unable to complete the estimation and 


warns that the model may not be identified. While convenient, there are disadvantages to using typical SEM 


software to determine model identifiability (Kenny and Milan 2012 Chen and Pearl, 2014). If poor starting 


values are chosen, the program could mistakenly conclude the model is not identified when in fact it may 
be identified. When the model is not identified, the program is not helpful in indicating which parameters 
are not identified nor are they able to provide estimates for identifiable coefficient^] Most importantly, the 
program only gives an answer after the researcher has taken the time to collect data. 

Rather than determining the identifiability of parameters by attempting to fit the model, we will detect 
identifiability directly from the model specification and express identified parameters in terms of the popula¬ 
tion covariance matrix. As a result, the modeler can estimate their values, even when the model, as a whole, 
is not identified. Additionally, we avoid issues of poor starting values and can determine the identifiability 
of parameters prior to collecting data. 

As compact and transparent representations of the model’s structure, causal graphs provide a convenient 
tool to aid in the identification of coefficients. First utilized as a causal inference tool by Wright (1921), 


graphs have more recently been applied to identify causal effects in non-parametric causal models (Pearl 


1 We will also use the term “identified” with respect to individual variables and the model as a whole. When the structural 
equation for a given variable contains one or more unidentified coefficients then we say the variable is unidentified. Similarly, 
when a model contains on e or more u niden tified coefficients than the model is not identified. 

2 According to Kenny an d Milan] (2012), AMOS is the only program that attempts to estimate parameters when the model 
is not identified. 
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2000) and enabled the development of causal effect identification algorithms that are complete for non- 


parametric models (Tian, 2002 Huang and Valtorta 2006 Shpitser and Pearl 2006). These algorithms can 
be applied to the identification of coefficients in linear SEMs by identifying non-parametric direct effects, 
which are closely related to structural coefficients (Tian, 2005; Chen and Pearl, 2014). Algorithms designed 
specifically for the identification of linear SEMs were developed by Brito and Pearl (2002), 

Tian (2005 2007, 2009), Foygel et al. (2012), and Chen et al. ( 2014| ). 


Brito (2004), 


Surprisingly, none of the aforementioned identification methods subsume the algorithms designed for non- 
parametric models when naively applied to the identification of linear SEM^J despite the added assumption 
of linearity. In this paper, we first extend the half-trek identification methods of |Foygel et al. (2012) and Chen 


et al. 


(2014). Our extensions can be applied to both semi-Markovian and non-Markovian models. We then 
demonstrate how recursive c-component decomposition, which was first utilized in identification algorithms 


for semi-Markovian non-parametric models (Tian, 2002 Huang and Valtorta, 2006 Shpitser and Pearl 


2006), can be utilized for semi-Markovian linear models. We show that applying recursive decomposition 


to our identification method allows the identification of additional coefficients and models. Further, we will 
demonstrate that this method subsumes the aforementioned non-parametric identification methods when 
applied to linear SEMs. Lastly, we note that our method is not complete and there are identifiable coefficients 
in certain models that cannot be identified using the methods given. 


2 Preliminaries 

A linear structural equation model consists of a set of equations of the form, 

X = AX + e, 

where X = [oq, ...,x n ] t is a vector containing the model variables, A is a matrix containing the coefficients 
of the model, which convey the strength of the causal relationships, and e = [e \,..., e n ] t is a vector of error 
terms, which represents omitted or latent variables. The matrix A contains zeroes on the diagonal, and 
A ij = 0 whenever xi is not a cause of Xj. The error terms are random variables and induce the probability 
distribution over the model variables. The covariance matrix of X will be denoted by E and the covariance 
matrix over the error terms, e, by Q. 

An instantiation of a model M is an assignment of values to the model parameters (i.e. A and the 
non-zero elements of 11). For a given instantiation m*, let E(m*) denote the covariance matrix implied by 
the model and Afe(m*) be the value of coefficient Xj. In this paper, we identify the model coefficients from 
the covariance matrix, E. 

Definition 1. A coefficient, A k, is identified if Xkfnrii) = A k{ m j) whenever E (ra^) = S (rrij). 

In other words, A k is identified if it can be uniquely determined from the covariance matrix, E. 

The causal graph or path diagram of an SEM is a graph, G = (V, D , B ), where V are vertices or nodes, D 
directed edges, and B bidirected edges. The vertices represent model variables. Directed eges represent the 
direction of causality, and for each coefficient A^ 7 ^ 0, an edge is drawn from xi to Xj. Each directed edge, 
therefore, is associated with a coefficient in the SEM, which we will often refer to as its structural coefficient. 
The error terms, e*, are not represented in the graph. However, a bidirected edge between two variables 
indicates that their corresponding error terms may be statistically dependent while the lack of a bidirected 
edge indicates that the error terms are independent. When the causal graph is acyclic without bidirected 
edges, then we say that the model is Markovian. Graphs with bidirected edges are non-Markovian , while 
acyclic graphs with bidirected edges are additionally called semi-Markovian. 

If a directed edge, called (x, y), exists from x to y then we say that x is a parent of y. The set of parents 
of y is denoted Pa(y). Additionally, we call y the head of (x,y) and x the tail. The set of tails for a set 
of directed edges, E , is denoted Ta(E) while the set of heads is denoted He(E). For a node, v, the set of 
edges for which He(E) — v is denoted Inc(v). If there exists a path of directed edges from x to y then we 
say that x is an ancestor of y. The set of ancestors of y is denoted Anc(y). If x is an ancestor of y , then y 
is a descendant of x. The set of descendants of x is denoted De(x). Finally, the set of nodes connected to y 
by a bidirected arc are called the siblings of y or Sib{y). 

3 The coefficient / in Figure [2a| is identified using non-parametric algorithms but not the aforementioned linear algorithms. 
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A path from x to y is a sequence of edges connecting the two vertices. A path may go either along or 
against the direction of the edges. A non-endpoint vertex w on a path is said to be a collider if the edges 
preceding and following w on the path both point to w, that is, —)> u; G-, u; G-, -4^0, or w; ff. A 
vertex that is not a collider is a non-collider. 

A path between x and y is said to be unblocked given a set Z (possibly empty), with x,y £ Z if 


1. every noncollider on the path is not in Z and 


2 . every collider on the path is in An(Z) (Pearl, 2009). 


3 Extending the Edge Set Half-Trek Criterion 


The half-trek criterion is a graphical condition that can be used to determine the identifiability of recursive 


and non-recursive linear models (Foygel et al., 2012 ). Foygel et al. ( 2012 ) use the half-trek criterion to identify 


the model variables one at a time, where each identified variable may be able to aid in the identification of 
other variables. If any variable is not identifiable using the half-trek criterion, then their algorithm returns 
that the model is not HTC-identifiable. Otherwise the algorithm returns that the model is identifiable. 
Their algorithm subsumes the earlier methods of Brito and Pearl ( | 2002 ) and |Brito (2004). Chen et al. 
(2014) modified the half-trek criterion (calling it the edge set half-trek criterion ) to identify connected edge 
sets. Rather than attempting to identify all of a variable’s coefficients at once, they instead partition the 
coefficients according to connected edge sets. As a result, if one coefficient is not identified, only its connected 
edge set will not be identified rather than the entire variable. In this way, they increase the granularity of 
the criterion to identify additional coefficients in unidentifiable models. In this section, we will paraphrase 


some preliminary definitions from (Foygel et al. 2012 ) and further extend the edge set half-trek criterion. 


Without c-component decomposition, this extension will allow identification of even more coefficients in 
unidentifiable models. Coupled with recursive c-component decomposition, our extended half-trek criterion 
allows the identification of additional models, as well. 


3.1 Preliminary Definitions 

We begin by establishing a couple preliminary definitions around half-treks. These definitions and illustrative 


examples can also be found in Foygel et al. ( 2012 ) and Chen et al. (2014). 


Definition 2. {Foygel eta l, 2012) A half-trek, tt, from x to y is a path from x to y that either begins with 
a bidirected arc and then continues with directed edges towards y or is simply a directed path from x to y. 

We will denote the set of nodes connected to a node, v, via half-treks htr(v). 


Definition 3. (Foygel et al, 2012 [) For any half-trek, tt, let Right( 7 r) be the set of vertices in n that have an 
outgoing directed edge in it (as opposed to bidirected edge) union the last vertex in the trek. In other words, 
if the trek is a directed path then every vertex in the path is a member of Right (it). If the trek begins with a 
bidirected edge then every vertex other than the first vertex is a member of Right (tt). 


Definition 4. (Foygel et al ., 2012 1) A system of half-treks, tt\, ..., 7 r n , has no sided intersection if for all 
iTi,7Tj G {tti, ..., 7r n } such that TTi ^ tt j , Right (tt i )C\Right (tt j )= 0. 


Definition 5. {Chen et al. , 2014) For an arbitrary variable, V, let Pai,Pa 2 ,..., Pa& be the unique partition 
of Pa(V) such that any two parents are placed in the same subset, Pai, whenever they are connected by 
an unblocked path. A connected edge set with head V is a set of directed edges from Pai to V for some 
i G {1,2,,., k}. 


3.2 General Half-Trek Criterion 

Having established the preliminary definitions in the previous subsection, we now give our extension of the 
edge set half-trek criterion. 
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Definition 6. (General Half-Trek Criterion) Let E be a set of directed edges sharing a single head v. A set 
of variables Y satisfies the general half-trek criterion with respect to E, if 

(i) \Y\ = \E\, 

(ii) Y n(v U Sib(v )) = 0, 

(in) There is a system of half-treks with no sided intersection from Y to Ta(E), and 
(iv) (Pa(v) \ Ta(E )) D htr(Y) = 0. 

A set of directed edges, E , sharing a head v is identifiable if there exists a set, Ye, that satisfies the 
general half-trek criterion (g-HTC) with respect to E, and Ye consists only of “allowed” nodes. Intuitively, 
a node, y , is not allowed if 

(i) it is half-trek reachable from He(E) and the coefficients of y that he on the half-treks from He(E) are 
not identifiable or 

(ii) y is connected to Pa(v) \Ta(E ), and the coefficients of y that he on unblocked paths between Pa(v ) \ 
Ta(E) are not identifiable. 

Otherwise, y is allowed. The following definition formalizes this notion. 

Definition 7. Let E be the set of directed edges in a causal graph, G. We say that a node, z, is g-HT-allowed 
(or simply allowed ) for directed edges E v C E with head v if 

(i) (a) z is not half-trek reachable from v and 
(b) z is not connected to Pa(v ) \ Ta(E) 

OR 

(ii) there exists a partition of E, ES, such that E v G ES, an ordering on ES, -<, and a family of subsets 
(Y Ei ), one subset for each Ei -< E v , such that Y Ei satisfies the g-HTC with respect to Ei and Ej -< E k 
for (Ej , Eh -< E v ) whenever 

(a) He(Ej) C htr(He(E k )) C\Y Ek or 

(b) Y Ek is connected to Pa(He(E k )) \ Ta(E k ) via edges in the set Inc(Y Ek ) fi Ej 

and any directed edges belonging to z that lie on a half-trek from v to z or lie on a path between z and 
Pa(He(E k )) \ Ta(E k ) belong to a set E z C ES ordered before E v . 

If Y is a set of allowed variables for E v that satisfies the half-trek criterion with respect to E v , we will 
say that Y is an g-HT-admissible set for E v . We are now ready to use the g-HTC to identify coefficients. 

Theorem 1. If a g-HT-admissible set for directed edges E v with headv exists then E v is identifiable. Further, 
let Y Ev = {yi, ...,yu} be a g-HT-admissible set for E v , Ta(E v ) = {pi, ...,p k }, and E be the covariance matrix 
of the model variables. Define A as 

[(/ — A ) T Y\ yiPj , yi G htr(v ) or yi connected 
Ay = < to Pa(v) \ Ta(E v ), (1) 

Viihtriy) 

and b as 

{ [(/ - A) T S]j /itI , yi e htr(v ) or y* connected 

to Pa(v) \Ta(E v ), (2) 

X yiV , Vi i htr(v) 

Then A is an invertible matrix and A • A Ta (E,,).y = b. 
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Figure 1 : (a) b is identified using the g-HTC but not the edge set HTC (b) a is identified using the g-HTC 
but not the edge set HTC 


Proof. The proof for this theorem is similar to the proof of Theorem 1 in Foygel et al. (2012). Rather than 


giving a complete proof, we simply explain why our changes are valid. The g-HTC identifies arbitrary sets 
of directed edges belonging to a node rather than all of the directed edges belonging to a node. It is able 
to do this because of two changes. First, sets that contain nodes that are connected to Pa(v ) \ Ta(E ) via 
half-treks cannot be half-trek admissible for E (see Definition 6 ). As a result, the paths from half-trek 
admissible set, Y#, to v travel only through coefficients of E and no other coefficients of E. This ensures 
that A • A Ta(E),v — b. Second, nodes that are connected to Pa(y) \ Ta(E) are not allowed unless their 
coefficients that he on paths to Pa(v ) \ Ta(E ) are identified. Likewise, nodes that are half-trek reachable 
from v are not allowed unless their coefficients that he on the half-treks from v are identified. This ensures 
that A and b are computable. Other coefficients need not be identified because they will vanish from A and 


b during the computations, ((/ — A)- E,) yiPj and ((/ — A) T • Y) ViV , due to zeroes in the matrix E. 


□ 


The g-HTC impoves upon the edge set HTC because subsets of connected edge sets may be identifiable 
even when the connected edge set as a whole is not. 


Theorem 2. Any coefficient identifiable using the edge set HTC is also identifiable using the g-HTC. More¬ 
over, the g-HTC is able to identify coefficients in certain models that the edge set HTC is not. 

Proof. First, we note that when the set of edges in question is a connected edge set, then the edge set HTC 
and the g-HTC are equivalent. As a result, any set of edges identifiable using the edge set HTC is identifiable 
using the g-HTC, and we have proven the first part of the Theorem. 

We will now use Figures [la| and [lb] to show that certain coefficients that are not identifiable using the edge 
set HTC are identifiable using the g-HTC. The edge set HTC identifies coefficients associated with connected 
edge sets altogether. As a result, the coefficient b in the graph depicted in Figure [Ta| is not identifiable using 
the edge set HTC since it belongs to the connected edge set {V 2 V 3 , V 4 —>> V 3 } and the coefficient d is not 

identifiable. However, b is identifiable using the g-HTC since V\ is g-HT-admissible for b. 

Additionally, coefficient a in Figure [lb] is not identifiable using the edge set half-trek criterion due to 
the fact that c is not identifiable. Using the g-HTC and Theorem [l] we can first identify b since V 2 is a 
g-HT-admissible set for b. Once b is identified, we can use Y a = V\ to identify a. □ 


3.3 g-HTC Algorithm 


Finding a g-HT-admissible set for directed edges, E , with head, u, fro m a set of allo wed nodes, 
can be accomplished by utilizing the max-flow algorithm described in Chen et al. (2014jp 


Ae, 

which we call 

MaxFlow(G/(£’, A)). This algorithm returns a maximal set of allowed nodes that satisfies (ii) - (iv) of the 


g-HTC. 

In some cases, there may be no g-HT-admissible set for E but there may be one for E C E . In other 
cases, there may be no g-HT admissible set of variables for a set of edges E but there may be a g-HT 
admissible set of variables for E with E C E . As a result, if a HT-admissible set does not exist for E v , 
where E v = Inc(v ) for some node v, we may have to check whether such a set exists for all possible subsets 
of E v in order to identify as many coefficients in E v as possible. This process can be somewhat simplified 


4 Brito| (2004| utilized a similar max-flow construction in his identification algorithm. This was later modified by |Foygel] 
et al. ( |2012 > for the half-trek criterion and finally Chen et al. (2014|) for the edge set half-trek criterion. 
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(c) ( d ) 


Figure 2 : (a) The graph is not identified using the g-HTC and cannot be decomposed (b) After removing 
Vq we are able to decompose the graph (c) Graph for c-component, {V 2 , V 3 , V 5 } (d) Graph for c-component, 

{v u v 4 } 


by noting that if E is a connected edge set with no g-HT-admissible set, then there is no superset E' with 
a g-HT-admissible set. 

Algorithm [l] utilizes the g-HTC and Theorem [l] to identify as many coefficients in a recursive or non¬ 
recursive linear SEM as possible. It iterates through every unidentified connected edge set in the graph, 
attempting to identify each unidentified subset of its edges using Theorem[l] For a given set of directed edges, 
E, the initial set of HT-allowed nodes is the set of nodes that is not half-trek reachable from He{E). As more 
and more edges are identified, the set of allowed nodes increases so that a set of edges that was not initially 
identified may become identifiable as the algorithm iterates. We define Allowed(E, IDEdges, G) to be the set 
of nodes that are either (i) not half-trek reachable from He(E) and not connected to Pa(He(E)) \ Ta(E) 
or (ii) their edges that lie on half-treks from He{E) or paths to Pa(He(E)) \ Ta(E) are in IDEdges. As a 
result, Algorithm [l] attempts to identify each connected edge set (and its subsets) until all coefficients have 
been identified or no new coefficients are identified in a given iteration. 


Algorithm 1 HT-ID(G, E, IDEdges) 

Initialize: EdgeSets <— all connected edge sets in G 

repeat 

for each ES in EdgeSets do 
v <- He(ES) 

for each E C ES such that E <f_ IDEdges do 
Ae Allowed(E, IDEdges, G) 

Ye <— MaxFlow(G/(u, Ae)) 
if |Ye| = \Ta(E)\ then 

Identify E using Theorem [l] 

IDEdges IDEdges U E 
end if 
end for 
end for 

until All coefficients have been identified or no coefficients have been identified in the last iteration 

return IDEdges 


4 Decomposition 


Tian (2005]) showed that the identification problem could be simplified in semi-Markovian linear structural 
equation models by decomposing the model into sub-models according to their c-components, defined below. 
Each coefficient is identifiable if and only if it is identifiable in the sub-model to which it belongs. |Foygel| 
et al. ( 2012 ) noted that first decomposing the model in this way and then applying the half-trek criterion 
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allows for the identification of a larger set of models. In this section, we will briefly review the results of 


Tian (2005) before showing that the c-component decomposition can be applied recursively to the model 
after marginalizing certain variables. This idea was first used to identify interventional distributions in 
non-parameteric models by Tian (2002) and Tian and Pearl (2002) and adapting this technique for linear 
models will allow us to identify models that the g-HTC, even coupled with (non-recursive) c-component 
decomposition, is unable to identify. Further, it ensures the identification of all coefficients identifiable using 
methods developed for non-parametric models-a guarantee that none of the existing methods developed for 
linear models satisfy. 


4.1 C-Component Decomposition 

Definition 8. ( Tian\ \ 200 5) The c-component of a node v in a causal graph, denoted C{y), is the set of 
nodes that is connected to v by paths consisting only of bidirected arcs. 


Tian 


(2005) showed that a coefficient is identified if and only if it is identified in the sub-graph consisting 

More formally, a coefficient from x to y is identified 


of its c-component and the parents of the c-component 
if and only if it is identified in the sub-model constructed in the following way: 


(i) The sub-model variables consist of C(y ) U Pa(C(y)). 

(ii) The structural equations for the variables in C(y ) are the same as their structural equations in the 
original model while the structural equations for the parents simply equate each parent with its error 
term. 

(iii) If the error terms of any two variables in the sub-model were uncorrelated in the original model then 
they are uncorrelated in the sub-model. 


When we decompose the model according to c-components, the distribution over the variables in the 
sub-model, W = S U Pa(S'), for a c-component S , is not simply the marginal distribution over W. Instead, 
the distribution, which we denote Ps , is computed from the joint distribution over the model variables, 
P(V). Letting V = {vi,V2, ...,t> n } be ordered topologically, we have 


p s (w)= n p^v^) n p(vi), 

Okes} 0|«,ePa(S)} 


(3) 


where V^~ ^ = {vi, v 2 ,..., fj-i}. 

For example, consider the graph depicted in Figure[2b] We have two c-components, Si = {(> 2 , v 3 , v~, } and 
S 2 = {^ 1 ,^ 4 }, with 


P Sl =P{v 2 1 Vi )P(v 3 \v 2 , Vi )P(v 5 \v 4 ,v 3 ,v 2 ,v 1 ) 
P(V 3 )P(V4) 


and 

p s 2 = P(vi)P(v 4 \v 3 ,v 2 ,v 1 )P(v 3 )P(v 2 )P(v 3 ). 


We will denote the sub-model for a c-component, S , Ms and the graph it induces Gs • Gs x is depicted 
in Figure [2c] and Gs 2 in Figure 2d 


Theorem 3 . {Tian , 2005) Let a variable Vj be in a c-component Sj in a SEM M . A coefficient, A ij, is 


identifiable if and only if it is identifiable in the model Ms ,. 


4.2 Recursive C-Component Decomposition 

The graph in Figure |2a| consists of a single c-component, and we are unable to decompose it. As a result, 
we are able to identify a but no other coefficients using the extended half-trek criterion. Moreover, / = 
■^E[v^\do(vQ, 174 , vs, V 2 , vi)] is identified using identification methods developed for non-parametric models 
(e.g. do-calculus) but not the g-HTC or other methods developed for linear models. 


7 


























However, if we remove Vf\ from the analysis, then the resulting model can be decomposed. Let M be the 


model depicted in Figure 2a P(v) be the distribution induced by M, and M be a model that is identical to 


M except the equation for vq is removed. M' induces the distribution f ve P(V)dve , and its associated graph 


2b 


G yields two c-components, as shown in Figure 

Now, decomposing G according to these c-components yields the sub-models depicted by Figures |2c| and 
2d Both of these sub-models are identifiable using the half-trek criterion. Thus, all coefficients other than 


ft have been shown to be identifiable. Returning to the graph prior to removal, depicted in Figure [2a| we 
are now able to identify h because both v 4 and V 5 are now allowed nodes for ft, and the model is identified 

As a result, we can improve our identification algorithm by recursively decomposing, using the extended 
half-trek criterion, and removing descendant set^J Note, however, that we must consider every descendant 
set for removal. It is possible that removing D\ will allow identification of a coefficient but removing a 
superset D 2 with D\ C D 2 will not. Additionally, it is possible that removing D 2 will allow identification 
but removing a subset D\ will not. 

After recursively decomposing the graph, if some of the removed variables were unidentified, we may be 
able to identify them by returning to the original graph prior to removal since we may have a larger set of 
allowed nodes. For example, we were able to identify ft in Figure [2a] by “un-removing” vq after the other 
coefficients were identified. In some cases, however, we may need to again recursively decompose and remove 
descendant sets. As a result, in order to fully exploit the powers of decomposition and the g-HTC, we must 
repeat the recursive decomposition process on the original model until all marginalized nodes are identified 
or no new coefficients are identified in an iteration. 

Algorithm [ 2 ] decomposes the graph according to its c-components and then applies HT-ID (Algorithm [l]) 
to each sub-model. If there are still unidentified coefficients, then it removes descendant sets and decomposes 
again. The whole process is repeated until every coefficient is identified or no new coefficients are identified 
in an iteration. Ep s . is the covariance matrix of P^., where Si is a c-component. Y, V \ D . is the covariance 
matrix after marginalizing Di from E. Finally, is the graph with the set Eft removed. 


Algorithm 2 Decomp-HT-ID(G, E) 

Initialize: IDEdges <— 0 
repeat 

IDEdges IDEdgesURec-Decomp(G, E, IDEdges) 
until All coefficients have been identified or no coefficients have been identified in the last iteration 
return IDEdges 

procedure Rec-Decomp(G, E, IDEdges)) 

V vertices in G 

Edges <— all edges in G 

for each c-component, Si. in G do 

IDEdges = IDEdgesU HT-ID(G 5 „ Es„ IDEdges) 
end for 

if IDEdges = Edges then 
Return IDEdges 
else 

for each descendant set, Dp in G do 

IDEdges <— IDEdgesURec-Decomp(Gy\p>., E^., IDEdges 

end for 
end if 

return IDEdges 
end procedure 

5 While V 4 and v$ are technically not allowed according to Definition [7] they can be used in g-HT-admissible sets to identify 
h using Theorem [T| since their coefficients have been identified. 

6 Only removing descendant sets have the ability to break up -components. For example, removing {^ 2 } from Figure [2a| does 
not break the c-component because removing V 2 would relegate its influence to the error term of its child, V3. As a result, the 
graph of the resulting model would include a bidirected arc between V 3 and vq, and we would still have a single c-component. 
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We now show that any direct effect identifiable using non-parametric methods is also identified using 
Algorithm [2] 


Theorem 4. Let M be a linear SEM with variables V. Let M' be a non-parametric SEM with identical 
structure to M. If the direct effect of x on x for x,y G V is identified in M then the coefficient A xy in M 
is identified using Algorithm Q 


Proof. Let G be the causal graph of M and M . Suppose the direct effect of x on y is identified in M . Then 
according to Theorem 3 of (jShpitser, 2008), there does not exist a subgraph of G that is a ^/-rooted c-tree 
(Shpitser 2 008| ). This implies that MACSfy) = y. By recursively decomposing the graph into c-components 
and marginalizing descendant sets, we can obtain a graph where only MACS(y ) and its parents remain in 
the graph. Since MACS(Y ) = y, the parents of y in this graph represent a g-HT admissible set that allows 
the identification of all coefficients of y. □ 


5 Conclusion 


In this paper, we extend the edge set half-trek criterion (Chen et al. 2014). We then incorporate recursive c- 
component decomposition (Tian, 2002), and show that the resulting identification method is able to identify 
more coefficients and models than the existing algorithms developed for both linear and non-parametric 
models. 
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